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Abstract 

The Riemannian metric on the manifold of positive definite matrices is defined by a 
kernel function in the form Kfj{H, K) = J^ij 0(Ai, A^^^^Tr P,HPjK when X), A^P, is the 
spectral decomposition of the foot point D and the Hermitian matrices H, K are tangent 
vectors. For such kernel metrics the tangent space has an orthogonal decomposition. The 
pull-back of a kernel metric under a mapping D G{D) is a kernel metric as well. Several 
Riemannian geometries of the literature are particular cases, for example, the Fisher-Rao 
metric for multivariate Gaussian distributions and the quantum Fisher information. In the 
paper the case (f){x, y) = M{x, yY is mostly studied when M{x, y) is a mean of the positive 
numbers x and y. There are results about the geodesic curves and geodesic distances. The 
geometric mean, the logarithmic mean and the root mean are important cases. 
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Introduction 

The n X n positive definite matrices with complex entries can be parametrized by the real and 
imaginary parts of the entries, and they form an open subset of the space EI„, of n x n Hermitian 
matrices regarded as the Euclidean space M™, where m = m?. Hence the tangent space of their 
manifold P„ at any foot point can be identified with E[„. A Riemannian metric K£)(H, K) is a 
family of inner products on depending smoothly on the foot point D. If y) is a positive 
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kernel function on (0, cxd) x (0, oo) and D has the spectral decomposition '^i=i\Pi, then a 
Riemannian metric can be defined as 



k 



(0.1) 



where TV is the usual trace on matrices. The goal of the present paper is to study this kind of 
Riemannian metrics. 

As far as the authors know, the first example of (jO.ip is historically the case ^(x, y) = xy 
which was considered by Skovgaard [36] as a Fisher-Rao statistical Riemannian metric on 
positive definite matrices describing multivariate Gaussian distributions. Another example is 
also related to Fisher information. In the quantum mechanical setting the states correspond 
to positive semidefinite matrices of trace 1, and in [30\ I34j the metric (jO.ip was justified in the 
particular case (/>(x, y) = yf{x/y), where / : (0, oo) — > (0, oo) is an operator monotone function. 
More details on these examples are presented in the rest of this section. 

The trivial choice (jiix, y) = 1 gives a flat space where the Riemannian metric is the Hilbert- 
Schmidt inner product {H,K)i{s on EI„. The Hilbert-Schmidt inner product {X,Y)iis ■= 
TrX*Y and the Hilbert-Schmidt norm ||X||hs := (TrX*X)V2 are defined on the space of 
all n X n complex matrices, and the space (IH„, (•, •)hs) is a real subspace of the Hilbert space 



The positive definite real matrices might be considered as the variance of multivariate normal 
distributions and the information geometry of Gaussians yields a natural Riemannian metric. 
The simplest way to construct an information geometry is to start with an information potential 
function and to introduce the Riemannian metric by the Hessian of the potential. We want a 
geometry on the family of non-degenerate multivariate Gaussian distributions with zero mean 
vector. Those distributions are given by a positive definite real matrix D in the form 



We identify the Gaussian with the matrix D, and we can say that the Riemannian geometry 
is constructed on the space of positive definite real matrices. There are many reasons (originated 
from statistical mechanics, information theory and mathematical statistics) that the Boltzmann 
entropy 



is a candidate for being an information potential. 

The n X n real symmetric matrices can be identified with the Euclidean space of dimension 
n(n+l)/2 and the positive definite matrices form an open subset. Therefore the set of Gaussians 
has a simple and natural manifold structure. The tangent space at each foot point is the set 
of symmetric matrices. The Riemannian metric is defined as the Hessian 



(M„,(-,->Hs). 




S{pd) '■= - log(det D) + const. 



9DiH,K) : 



S{pD+sH+tK) 



dsdt 



s=t=0 



where H and K are tangents at D. The differentiation easily gives 



gD{H,K) = Ti D-^HD-^K. 
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The corresponding information geometry of the Gaussians was discussed in [27] in detail. In the 
statistical model of multivariate Gaussian distributions, (j0.2p plays the role of the Fisher-Rao 
metric. We note here that this geometry has many symmetries. Each congruence transforma- 
tion of the matrices becomes a symmetry, namely 

gTDT^{THT\ TKT') = gD{H, K) (0.3) 

for every real invertible matrix T. 

Formula (|0.2p determines a Riemannian metric on the manifold P„ as well and below we 
prefer to consider the complex case. Note that if we want to find the geodesic curve between 
A and B, then it is sufficient to find the geodesic joining / and A^^/^SA^^/^ due to property 
()0.3p . This is essentially easier since they commute. In fact, concerning the geodesic curves in 
the Riemannian manifold (P„,(7), it is known [23 ^ 126 ^ [9] that for each j4,S G P„ there exists a 
unique geodesic shortest curve joining A, i? G P„ given by 

= A#j5:= Ai/2(^-i/2^^-i/2)t^i/2^ 0<t<l, (0.4) 
and the geodesic midpoint 7(1/2) is just the geometric mean ([351 [1]) 

Furthermore, the geodesic distance is 

5{A,B) = \\log{A-y^BA-'/')\\ns. (0.5) 

In this way, the information Riemannian geometry is adequate to treat the geometric mean of 
positive definite matrices. 

For each A,B £¥n the mean C := A^ B is the midpoint of the geodesic joining A and B, 
A' := B #C and B' := C # A are similar. Since 6{B #C,C#A)< ^S{A, B) by [3 Proposition 
6], the diameter of the triangle A'B'C is at most the half of the diameter of ABC. This result 
gives a geometric proof of the recursive construction of geometric mean of 3 positive matrices 
proposed in Note that another "geometric mean" of ^i, . . . , G was introduced in 
[26| [9] as the unique minimizer of ^4 G P^ i— > 5^(^, Aj). 

We denote by the set of all n x n positive definite matrices of trace 1, which is a smooth 
differentiable manifold as a submanifold of P„. The tangent space of the manifold at each 
foot point D is the subspace of ]HI„ consisting of n x n Hermitian matrices of trace 0, i.e., 
T^Dn = © ■= {H G IHI„ : Tr = 0}. One can define a Riemannian metric on Dn in the 
form 

where Sd is a positive linear operator on the real Hilbert space (H^ MI, (•, •)hs)- One can 
extend to a positive symmetric operator on IH^ and furthermore to a positive operator on 
the Hilbert space (M„,(-,-)hs) by complexification. So we may assume that a Riemannian 
metric Kd is given on Vn, n G N, by ^^(X,^) = {XJ]j^Y)us for X,Y £ M„. The metric 
Kd (more precisely, a sequence of metrics Kd on P„, n G N) is monotone if for any completely 
positive and trace preserving map (or coarse graining) /3 : M„ — > we have 

Kp(^D){P{x),p{x)) < Kd{x,x), DeVn, xe N„. 

Recall that /3 is completely positive and trace preserving if and only if /3* is completely positive 
and unital. It was proved in Petz [30] that the monotone metrics with normalization 
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K£){I,I) = Tr (Z? ^) correspond one-to-one to the operator monotone functions / : (0, cxd) — > 
(0,00) with normahzation /(I) = 1 as follows: 

:= (X,(j{,)-iy)HS and j{, := /(Lz^M^i)^!,. (0.6) 

Furthermore, Kj-^ is symmetric if and only if / is symmetric, i.e., xf{x~^) = f{x), x > 0. 
We say that an operator monotone function / > on (0,oo) is standard if /(I) = 1 and 
= fix). 

On the other hand, the theory of operator means due to Kubo and Ando [22j says that there 
is a one-to-one correspondence between the symmetric operator means (or matrix means) and 
the standard operator monotone functions / as follows: 

af{A, B) := A^/^f{A-^/^BA-^/^)A^I'^, A,B e Pn- 

Thus one may write 

k{,{X,Y) = {X,af{hD,^Dr'Y)ns. (0.7) 
When D = Diag(Ai, . . . , A„) is diagonal, one can more explicitly write 

n 

For each standard operator monotone function /, the symmetric monotone metric (or the 
quantum Fisher information) xj^ originally defined on by (|0.6p or (j0.7p can be automatically 
extended to P„ by the same formula. 

It was also observed in Lesniewski and Ruskai |24| that any of the above metrics K-f can 
be realized as the Hessian 



k{j{H, K) = -^Sf{D + sH,D + tK) 



s=t=0 



dsdt 

of a quasi-entropy |28^ [29] Sf{Di, D2) defined by a function F on (0, 00) with the relation 
l//(x) = {F{x) +xF{x-^))/{x - 1)2. 

The Wigner- Yanase-Dyson skew information is the quantity 

I™{p,K):=-^Tt[DP,K][D'-P,K], D G P„, K G E[„, 

where < p < 1. The case p = 1/2 is the original Wigner- Yanase skew information. It was 
observed in [33j that the Wigner- Yanase-Dyson skew information I^^^ [p, K) coincides, apart 
from a constant factor, with a monotone Riemannian metric 

Kf^{i[D,K]MD,K]), 

where fp is a standard operator monotone function defined by 

ix - 1)2 

fJx) := p(l - p)- TT- (0-8) 

The notion of skew information was recently generalized by Hansen [13] as follows: For each 
standard operator monotone function / that is regular, i.e., /(O) (:= \mix^Qf{x)) > 0, the the 
metric adjusted skew information (or the quantum skew information) corresponding to / is 

l{,{K):=^K(,{i[D,K],i[D,K]), I) G P„,, G H„,, (0.9) 
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which is exphcitly written as 



HD = Diag(Ai, . . . , A„). 

Via the operator j{, in ()0.6p . each standard operator monotone function / defines a quantity 

^d[K,K] := {Kjj,K)ns, D G P„„ G IH„, (0.10) 

which was cahed generalized variance in [31]. Any such variance has the property ipj:,[K,K] = 
Tr DK^ for commuting D and -fC. 

In the present paper we study Riemannian geometry on ]P|^ with kernel metrics K'^ in (jO. ip 
when the kernel function (f){x^y) is in the form M{x,yY , a degree G M power of a certain 
mean M{x,y) for two positive numbers (as prescribed at the beginning of Section 2). The 
above quantities (10. 2p . ()0.6p and (jO.lOp are important special cases where = 2, 1 and — 1, 
respectively. The paper is organized as follows. After describing our setting in Section 1 in 
more detail, in Section 2 we determine Riemannian metrics in our class which are written 
as a pull-back of the Euclidean metric. For such metrics the geodesic curve and the geodesic 
distance are explicitly given (Theorem l2.ip . Section 3 is concerned with the (non-)completeness 
of Riemannian metrics in our class (Theorem 13. ip and pull-back metrics from the Fisher- Rao 
metric g (Theorem 13. 3p . In Section 4 we discuss comparison properties among our Riemannian 
metrics. The comparison of geodesic distances for two metrics is easily described in terms of the 
corresponding means and the degrees of power (Theorem 14. ip . Finally in Section 5, we treat 
the generalized situation (of Finsler metrics rather than Riemannian metrics) where unitarily 
invariant norms are applied in place of the Hilbert-Schmidt norm. 

For basics on Riemannian geometry the reader may refer to texts [181 [25] for example. 

1 Riemannian metrics induced by kernel functions 

For each D G Pn the left and right multiplication operators L/j and M/j are defined as "L^X := 
DX and MjjX := XD for X G M„. Note that L^) and Md are commuting positive operators on 
the Hilbert space (M„,(-, •)hs), i-e., I^d^d = ^d^d, (X,Lz5X)hs > and (X,R£,X)hs > 
for all X G M„. For a kernel function (j) : (0, oo) x (0, oo) (0, oo), a positive operator 
(/>(L£),M£)) on (M„, ( • , • )hs) is defined via functional calculus, that is, when D = Yl\=i ^i^i is 
the spectral decomposition, 

k 
1=1 

When (p{x, y) is smooth in x and y, one can define a Riemannian metric K'^ on by 

k 

Ki{H,K) := {H,(t){l.D,^D)-^K)ns = P^HPjK (1.1) 

when H,K eUn. 
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By taking the diagonalization D = C/Diag(Ai, . . . , \n)U* with a unitary U , one can also 
write 



(Lz5,Ki3)"i/2^ = C/| 



1 



{U*HU) I U- 



y(/)(Ai,Aj)J ^ 7 
where o denotes the Schur (or Hadamard ) product . 
Lemma 1.1. For eac/i D G P,„ /e^ 

:= {H eUn: HD = DH] and := {i[D, K] : K £ M^}. 

Then 

(1) Kf,{H,K) = Tr^{D)HK if H £ and K £ H„, where 4>{x) := l/(l){x,x), x > 0. 

(2) Kf,{H,i[D,K]) =OifH£T^ and K £ EI„. 



fl.2) 



(3) Kt,{i[D,K],i[D,K]) = {K,^{hD,RD)K)ns for all K £ 



where 



{x,y) ■ 



x,y >0. 



In particular, the tangent space Td 
respect to K'^. 



4>{x,y) ' 

n has an orthogonal decomposition Td = T^ © T^ with 



The proof of the lemma is left to the reader, which is easy by using (jl.ip . 
When 7 : [0, 1] ^ P„ is a curve (or more generally, a continuous and piecewise curve), 
the length of 7 with respect to the metric K'^ is given by 



L^il) ■•= C ^K'^,(t)(^'{t),i{t))dt= r||<A(L^(,),M^(i))-i/V 



HS 



dt. 



:i.3) 



Note that the length L^{'^) is independent of the choice of the parametrization of 7. The 
geodesic distance 5^{A^B) between A,B £Wn is the infimum of L(j,{'-f) over all curves (or 
equivalently, over all smooth curves) 7 from ^ to i?. A geodesic shortest curve is a curve from 
AtoB such that ^^(7) = 5^{A, B). 

Now let G be a smooth function from an open interval (a, ft) into (0,cxd). Assume that 
G'{x) 7^ for all x £ {a,b) so that G is a diffeomorphism from (a, 6) onto a subinterval of 
(0,00). Let Mn{a,b) denote the submanifold {A £ ]HI„ : a < A < b} oi H^, where a < A < b 
means that all the eigenvalues of A are in (a, b). Then the map A G{A) defined via functional 
calculus is a smooth diffeomorphism from IHI„(a,6) into P^- Our next aim is to determine a 
Riemannian metric on IH[^(tt, 6) such that A 1 — > G(^A^ is an isometry into the Riemannian 
space {¥n,K't'). This Riemannian metric on IHI„(a,6) is called the pull-back of K'^' under the 
transformation A ^ G{A). 

Lemma 1.2. Let Ka, A £ EI„(a,6), be the pull-back of the Riemannian metric K*^ on P„ 
under A ^ G{A) as mentioned above. Let A £ EI„(a, 6) and A = Yli=i'^i^i spectral 
decomposition. Furthermore, let T^ := {H £ H„ : HA = AH} as in Lemma{T^ Then 



[1] iG{A + tH)l^^ = G'{A)H zfH£TX. 
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(2) iG{A + ti[A,K])l^^ = i[G{A),K] for all Kg Mn. 

(3) For every H £ T^, 

(4) For every H e and K e M„, Ka{H, i[D, K]) = 0. 

(5) For every K £ EI„, 

^.(i[AK,,iM,A1)^|:||I;i|M^TVP.A-p,A-. 

Proof. (1) is obvious. 

(2) This is found in [32] but a short proof using the differential formula (see [6]) is given 
here. We may assume without loss of generality that A is diagonal as ^4 = Diag(ai, . . . 
With the Frechet derivative DG{A) : EI„ ^ ]HI„ of G at A, for K = [Kij] we have 

G{ai) — G{aj) 



^G{A + t\[A, K]) = DG{AmA, K]) 



dt 



[i{ai - aj)Kij] 



i[{G{ai)-G{a^))K,,]=i[G{A),K]. 



(3) By the isometry property together with the above (1) and Lemma ll.ll fl) we get 



Ka{H,H) = Kg^a){G'{A)H,G'{A)H) = Tr:cP{G{A))G\AfH^ 

= V ^'^^'^^ Tr PH^ 

^<A(G(A.),G(A.)) ' ■ 

(4) By the isometry property together with the above (1), (2) and Lemma ll. 11 (2) we get 

KA{H,i[A,K]) = KGiA){G'{A)H,i[G{A),K]) =0. 

(5) Similarly, by Lemma [LT] (3). 

KA(i[A, K]MA, K]) = i^G(A)(i[G(^), K],i[G{A), K]) 

G(A.)-G(A,))2 



V ^^^i^-^^^^)' TtPKPK 
^^^0(G(A.),G(A,)) -L^^^^^^^- 



□ 



In particular, let G be a smooth function from (0, cxd) into (0, cxd) such that G'{x) ^ for 
all X > 0. Let G^^\x,y) be the divided difference of G, i.e., 

'^^^^ ifx/,, 
G'{x) if X = y. 

Then, from Lemmas 11.11 and 1 1 . 2 1 we arrive at the following result. 

Theorem 1.3. The pull-back of the kernel metric under the mapping D G P„ G{D) G P„ 
is a kernel metric corresponding to the function 
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2 Pull-back metrics from the Euclidean metric 

We are concerned with the Riemannian metric K'l' related to a kernel function (p which is a 
power of a certain mean for two positive numbers. As in [14] a symmetric homogeneous mean 
is a function M : (0, oo) x (0, oo) (0, oo) such that for every x,y > 0, 

(1) M{x,y) = M{y,x), 

(2) M{ax, ay) = aM(x, y) for all a > 0, 

(3) M{x,y) is non-decreasing in 

(4) min{x,y} < M(x,y) < max{x,y}. 

The above mean M is determined by a single variable function M(x,l) since M{x,y) = 
yM{x/y,l). The set of all symmetric homogeneous means was denoted by dJt in [H], so 
in this paper we denote by DJIq the set of all smooth symmetric homogeneous means. Here a 
symmetric homogeneous mean M(x,y) is smooth if so is M(a;, 1). This means that M(x,y) is 
smooth in x,y > 0. 

In the rest of the paper we always assume n > 2 since the situation is trivial when n = 1. 
We assume that (/> is a power of an M G Tlo with degree E M, i.e., 4>{x,y) := M{x,y)^. The 
aim of this section is to determine when the Riemannian metric K"^ derived from M and 6 is 
a pull-back of the Euclidean metric. We are interested in this problem because the geodesic 
shortest path in that case is explicitly written as the pull-back of a segment in the Euclidean 
space. 

Theorem 2.1. Let M £ TXq, gR with 9^0 and 4>(x,y) := M{x,yf . Assume that F is a 
smooth function from (0, oo) into M such that F'{x) ^ for all x > 0. Then the transformation 
-D G Pn 1-^ F{D) G EI„ is isometric from {¥n,K'^) into the Euclidean manifold (EI„, || • ||hs) ^/ 
and only if 



Moreover, in this case, for every A, S G P„ a unique geodesic shortest curve from A to B 
is given by 




(2.1) 



(up to a constant c) and 




(2.2) 



[ex.p{{l -t)logA + tlogB), 0<t<l 
and the geodesic distance between A and B is 




if 0^0,2 
if = 2, 



(2.3) 
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Proof. Let (o, b) be the range of F (which must be an open interval by assumption) and 
G := : (a, b) (0, oo) be the inverse of F. The stated property of isometric transformation 
means that the pull-back of K't' via G is the Euclidean metric on the submanifold ]HI„(a, b) of 
M-n- From (3)-(5) of Lemma 11.21 one can easily see that this property is equivalent to that the 
following two conditions hold: 



G{ty 

{G{s) - G{t))' 



(s-t)^, s,t£{a,b). 



<A(G(s),G(t)) 

It is obvious that the above two are respectively equivalent to the following: 

F'{xf = a;-^ x>0, (2.4) 

^^^ = (F(x)-F(y))2, x,y>0. (2.5) 

The differential equation ()2.4p determines F as (12. ID . and this together with (12. 5p determines 
M as ([2:2]) . 

The rest of the theorem immediately follows from the isometric transformation via F in 
(|2.ip . One may just note that the segment joining H,K £ E[„ is a unique shortest path 
between H and K in the Euclidean manifold (Hn, 

II • IIhs)- □ 

In the following we present a bit more direct proof of Theorem 12.11 Formula (j2.7p below 
will be also useful in our discussions in the rest of the paper. Let F and G := F~^ be as 
above. For each G^ curve 7 : [0, 1] — > we make a change of variable ^(t) := F['y{t)), hence 
7(t) = G{C{t)). We then have 

<,)(7'(0,7 (t)) = ||0(L,(,),M,(,))-i/2y(t)||2,s 

and 

7'(t) = 7^G(e(t))(^'(t)), 

where DG{(,{t)) : ]HI„ — > EI„ is the Frechet derivative of G at £,{t). Under the diagonalization 
^{t) = [/Diag(Ai, . . . , Xn)U* for each fixed t £ [0, 1], thanks to the differential formula (see [^) 

DGiatm'it)) = C/([gW(A„ A,-)],^. o iU*em))U* = GW(L5W,%w)^'(i) (2.6) 

as well as (11.21). we obtain 



0(L^(t),M^(t))-i/2y(t) =<^(LG(5(t)),MG(5(t)))-i/2^W(L5(t),M5(t))e'(i) 

GW(A„A,) 



^7 



V0(G(A,),G(A,)) 



o(C/*f(t)C/)U^*. (2.7) 



Hence we see that the metric on P„ is the pull-back of the Euclidean metric on Mn{a,b) 
via F if and only if 

GW(s,t) 



y^<P{G{s),Git)) 



±1 (2. 
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for all s,t G (a, b), where the right-hand side of l\2.8\\ is 1 or —1 according to G being increasing 
or decreasing. Since </>(x, a;) = x^, (j2.8p for s = t yields the differential equation 

G'{t) = ±G{tf/\ te{a,b). 

This is equivalently written as F'{x) = ibx~^/^, x > 0, which is solved as (12. ip . From ()2.ip and 
()2.8p we obtain (j2.2p . Thus we have proved Theorem 12.11 again. Note that one can even more 
simply prove the theorem by appealing to 

FW(x,y) : 



For G M, / 0, we write M0{x,y) for M{x,y) given in (|2.2p and (f)0{x,y) for Mg{x,yy. 
The family of means interpolates the following typical means: 

X ~\~ y 

M_2(x,?/) = MA{x,y) := — ^ — (arithmetic mean), (2-9) 

Mi(x,y) = M^{x,y) := (^ ^^^ ^ (root mean), (2.10) 

M2(x,y) = Mi^(x,y) := (logarithmic mean), (2-11) 

log X — log y 

M4{x,y) = MQ{x,y) := ^/xy (geometric mean). (2-12) 
Furthermore, we may define Mo(x,y) by taking the limit 

Mo{x,y) := lim Mg{x,y) = - ( — I (identric mean), (2.13) 



9->o e\yy 

and <j)o{x,y) = 1. Note also that 

X — y 2 
lim 



log X — log y e~*2 2 



2-9 2-9 

X 2 — y 2 



As mentioned in Introduction, monotone metrics ([30]) are among particularly important 
class of Riemannian metrics. Those are the kernel metrics in the case where 9 = 1 and 
M{x, 1) is operator monotone. In the case 9 = 1, the theorem says that the metric corre- 
sponding to the root mean (that is a special case of binomial means |14]). called the 
Wigner-Yanase metric, is a unique monotone metric which is a pull-back of the Euclidean 
metric. This was in fact proved by Gibilisco and Isola [12] in a slightly different approach. 
Other famous monotone metrics are the Bogoliubov metric (also called the Kuho-Mori metric) 
corresponding to the logarithmic mean Ml and the Bures- Uhlmann metric corresponding to 
the arithmetic mean Ma. 

In this way, we have found a one-parameter family Mg € OKq, 9 £ M, given in (12. 2p and 
(j2.13p . It is remarkable that this is a rather familiar family of means introduced in [37j with 
a different parametrization and called Stolarsky means in [101 §2.6]. A monotonicity property 
of the family was proved in ^37j, which we state in the next lemma for the convenience of 
references. 

Lemma 2.2. ([37]) For every x,y > with x ^ y, Mg{x,y) is strictly decreasing in £ M. 
Furthermore, lim6i_^_oo Mg{x, y) = max{x, y} and lim^^oo Mg(x, y) = min{x, y}. 
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Next we are concerned with the relation among the metrics K'^ under the reflection map 

Proposition 2.3. Let M(i),M(2) g TIq, 6*1, 6*2 G M and (t)^^\x,y) := M^^\x,yf^, k = 1,2. 
Then the Riemannian manifolds (P„,i^'^'^') and {Fn, K'^^'^^) are isometric under the reflection 
A I— > A~^ on Fn if and only if 

Wfl 4 , f M(^){x,y) Y^ f M('){x,y) Y' 
h + 6'2 = 4 and { ^= — ) = ( ;= — ) , x,y > 0. 



xy 



xy 



In particular, if (f){x,y) = M{x,y)^ with an arbitrary M £ TIq, then A A ^ is an isomet- 
ric transformation on {Fn,K'^). Moreover, for every G M, (P^, are 
isometric under A i-^ A~^ . 

Proof. If 7 is a curve in P„, then we have 

Hence A i-^ A~^ gives an isometry between (Fn,K'^^) and {Fn,K'^'^) if and only if 

for all D £ Fn and H G BI„. We may assume that D is diagonal. For D = Diag(Ai, . . . , A^) 
the above equality is written as 



1 



This hold for all G if and only if 



H 



HS 



1 



y<A(2)(A„A,)_ 



H 



HS 



that is, 



M'^^\x-^,y-y^x'^y'^ = M'^^\x,yp, x,y >0. 



Letting x = y implies x^ = x^^ . Hence 9i + 62 = 4: must hold and the above condition for 
M(^) and M(2) is rewritten as 



xy J V 



02 



x,y>0. 



Since this is obviously satisfied for 9i = 62 = 2 and M^^^ = M^'^\ the second assertion follows. 
A simple computation with (|2.2p gives the last assertion. □ 

Remark 2.4. The latter assertions of Proposition 12.31 can be extended as follows: For every 
6,6' £ R \ {2} the Riemannian manifolds iFn,K*'>) and (P„,/r*e') are isometric under the 
diffeomorphism 



2-6 



2-6' 



2-e 2- 

A^- 



and for every aG]R\{0}, ^1-^ A" is an isometric transformation on (P„, K'f''^ 
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An interesting problem concerning the family Mg is to determine the range of 6 for which 
Mg is an operator monotone mean, i.e., Mg(x, 1) is an operator monotone function on (0, oo). 
The cases 9 = —2,1,2 and 4 are among typical operator monotone functions as listed in (12. 9^ - 
(j2.12p . The problem has been settled by Kosaki ^20j in such a way that Mg(x, 1) is operator 
monotone if and only if — 2 < < 6. 

We give the next lemma on Mg for later use. 

Lemma 2.5. Let M}i{x,y) := 2xy/{x + y), the harmonic mean. Then Mio(x, 1) > H]i{x,l) 
for all X > with x ^ 1. For every 9 > 10, Mg{x, 1) < Mh(x, 1) if x 1) is sufficiently near 
1. 

Proof. The proof of the first assertion is elementary and omitted. To prove the second, let 
^ > 10 and a := {9 — 2)/2 > 4. Direct computations show 



(a + l)(a — 1) , ,n ,9, 



which give the desired assertion. □ 



M0(x,l)"+^ = a 



a + 1 
1 H ^ — [x 



Mh(x,1 



2x 

x + 1 



a+1 



a + 1 
1 H ^ — [x 



3 The degree 2 case 

A Riemannian manifold said to be complete if the distance induced from the Riemannian metric 
is complete. It is a general fact in Riemannian geometry that a geodesic shortest curve joining 
any two points exists in a complete Riemannian manifold. The next theorem shows that the 
Riemannian manifold (Pjj, K"^) treated in Section 2 is never complete except the case of degree 
9 = 2. 

Theorem 3.1. Let M E OTto; ^ G IK and cf){x,y) := M[x,y)^ . Then the Riemannian manifold 
{¥n-,K'^) is complete if and only if 9 = 2. Hence, when 9 = 2 (and M G OJIq is arbitrary), for 
any A,B G P„ there is a geodesic shortest curve joining A,B in (Pn; 

Proof. First assume 9^2. The proof of the non-completeness of (P^, K"^) is easy. Let 7(t) := tl 
for t > 0, where / is the n x n identity matrix. Since 

||</>(L,(,),M^(i))-i/V(t)||HS = \\Mit,t)-'/'L\\us = t-'l-"^, 
we have ^ 

^ ^i^:J(^)(Y(t),7'(t))dt <+oo if0<2, 

^i^^(,)(7'W,7'W)rft<+oo if0>2. 

Hence, if we define A]^ := il 9 < 2 and A^ := kin if 9 > 2, then it follows that {A^}'^^ is 
Cauchy with respect to the geodesic distance (5^. But it is clear that {Ak} does not converge 
in {Fn,K^). 

Next assume 9 = 2, and prove that {Fn,K'^) is complete. To do so, we need a lemma. 



12 



Lemma 3.2. IfM £ TIq and(j){x,y) := M{x,y)'^, then 6^{A, I) = \\ log^Hns for every A e 



Proof. We may assume that A is diagonal. Let 7 : [0, 1] 
diagonalize 7(t), < t < 1, so that 



be a curve from A to /, and 



7(t) = C/(t)Diag(Ai(t),...,A„(t))C/(tr 

with Ai(t) < ••• < Xn{t) and unitary matrices U{t). Here one can fix U{t), < t < 1, so 
that Ai(t), . . . , Xn{t) and U{t) are C"^ except branching points of Ai(t), . . . , A„(t) (see for 
example). Note that the set of branching points is at most countable. Hence, for each t except 
such branching points, we have 

j'{t) = U{t)Bmg{X[{t), X'n{t))U{tr + ?7'(t)Diag(Ai(t), . . . , A„(t))C/(0* 

+ U{t)mag{Xiit),...,Xn{t))U'itr 

so that 

U{tyj'{t)U{t) = Diag(A;(t), . . . , A;(t)) + U{tru'{t)Biag{Xi{t), A„(t)) 

+ Bie.g{Xi{t),...,Xn{t))U'{t)*U{t). 

Since U{t)*U{t) = I yields that U' {t)*U{t)+U{t)*U' {t) = O, the diagonal entries oi U (t)* {t)U (t) 
are X'i{t), . . . , A'„(t). Hence we get 



(L^(,),M^(,))-i/27' 



HS 



1 



M{Xi{t),Xj{t)) 



o (t/(t)*7'(i)t/(t)) 



> 



HS 



for all t except a countable set. Since ^(t) := Diag(log Ai(t), . . . , log A„(t)) is a curve (continuous 
in < t < 1 and except a countable set as mentioned above) from logyl to O, we get 



Furthermore, if ^ = Diag(Ai, . . . , A^) and 7o(i) := A^ * 
then one can easily compute 



> [' U'musdtyWlogAWns- 
Jo 

Diag(Aj-*, 



,Ai~*) for 0<t< 1, 



^<^(7o 



\ i=i 



logAllns, 



implying S^{A,I) = ||log^||HS- 



□ 



Proof of Theorem \3.1\ (continued). Let {Ak} be a (^(^-Cauchy sequence in P„. Since |(50(Afc, /) — 
6^{Ai,I)\ < 5^{Ai:,Ai) — > as /c, / ^ 00, it follows from Lemma [3r2] that (^^(vlfc, /) = || log ^fcHns 
is a bounded sequence and so sup^ || log74fc||oo < +00 (|| • ||oo being the operator norm). Hence 
there is an e > such that el < A^ < s~^I for all k. By compactness we can choose a 
subsequence {Ak^} of {A^} such that \\Ak^ — A\\oo for some A G Fn with si < A < 
£~^I. Here we may assume that {^fc} itself converges to A in operator norm. Then we have 
II log Afc - log A||oo and so || log - log A||hs ->■ 0. Define ^^(t) := (1 - t)log^A; + tlog^ 
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and 7fc(t) = e^'''^*) for < t < 1. For each fixed t G [0,1] diagonalize ^^(t) as ^^(t) 
FDiag(/xi, . . . ,^n)V* with a unitary V. By (j2.7p we get 



\\Mijkit)nk{t))-Wk 



HS 



HS 



HS 



where Mi^{x,y) is the logarithmic mean. Since e < e'^' < e ^ for all 1 < i < n, it follows that 



Therefore, for every < t < 1, 

||M(7fc(t),7.W)~H(i)llHS<e-'||eUi)ll 

so that 



HS 



for all 



e ^11 log^fc - logAllns 



Hence the result follows. 



<e II log — logj4||HS — >0 as ^ oo. 



□ 



Let (I)g{x, y) denote the degree 2 power of the geometric mean, i.e., (t)G{x, y) := Mq{x, y)^ = 
xy. The metric K'f"^ induced from (pQ is the Fisher-Rao metric g mentioned in Introduction. 
The completeness of the Riemannian manifold {¥n,K'^'^) was shown in [9]. Now we define a 
one-parameter family of kernel functions 



iV,(x,y) :=Q(xyr/2- 



where NQ{x,y) is understood as 



No{x,y) ■■= lini iVa(a;,y) 

a— >0 



X 



y 



log X — log y 



x,y > 0, a £ 



(logarithmic mean) 



(3.1) 



We have Ni{x,y) = ^/xy (geometric mean) and N2{x,y) = 2xy/{x + y) (harmonic mean). 
Note that Na{x,y) is symmetric and homogeneous in the sense of (1) and (2) at the beginning 
of Section 2 and N-a{x,y) = Na{x,y). When a > 2, Na does not belong to 9Jto since 
Na{x,l) as X ^ oo. When < a < 2, one can easily see by elementary calculus that 
Na(x, 1) is increasing in a; > and 1 < Na{x, 1) < x for all x > 1. It is also not difficult to see 
that Na{x,y) is strictly decreasing in a > for each x,y > with x ^ y. Thus {Nci}Q<a<2 is 
a family of means in 9Jto interpolating the logarithmic and the harmonic means. 

We determine when our Riemannian metric K'^ is a pull-back of up to a multiple 
constant, and moreover extend ()0.4p and (jO.Sp for g = K'^'^ to the family of metrics induced 
from the above Na. 

Theorem 3.3. Let M € OJIq, 6^ € M and (t){x,y) := M{x,yf . Let a > 0. Assume that F 
is a smooth function from (0,oo) into itself such that F'{x) ^ for all x > 0. Then the 
transformation D G P„ i-H' F{D) E is isometric from {¥n,aK'^) into {¥n,K'^'^) if and only 
if 9 = 2, a <2, F{x) = cx" (up to a constant c > Oj and M = Na- 
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In the above case, for every A,B^ 
n,K'l') from A to B given by 



there exists a unique geodesic shortest curve in 



and moreover 

64A,B) = ||log(A-"/2s"^-"/2)i/"||jjg. 

Proof. For any curve 7 in P„ let ^{t) := F{j{t)). Under the diagonalization 7(t) 
?7Diag(Ai, . . . , Xn)U* for each fixed t £ [0, 1] we have by ([O]) and (l2n) 



6G(iL^(i),M^(,))e' 



HS 



HS 



J 



/TOW 



o(c/V(t)f/)^ 

HS 

o ([/*y(t)^) 



HS 



Hence the isometry property stated in the theorem impHes that 



^F{x)F{y) 



x,y > 0. 



When X = y this yields 



F'{x) 
F{x) 



ax 



-e/2 



X > 0. 



(3.2) 
(3.3) 



Suppose 6^2. Then (13. 3p is solved as 

F{x) = cexp 

with a constant c > 0. By this and (j3.2p . (t){x^ 1) is written as 



2a tdL 
x 2 



(x,l) 



a e 



with r : = 



2a 



If < < 2, then 0(x, 1) ^ as x ^ cxd. But this is inconsistent with 4'{x, 1) = M{x, 1)^ > 1 
for X > 1. If < 0, then x~^(j){x, 1) ^ as x ^ cxd, which is inconsistent with </)(x, 1) = 
M{x, 1)^ > x^ for X > 1. If ^ > 2, then x~^(f>{x, 1) — > as x — > 0, which is also inconsistent 
with (p{x, 1) > x^ for < x < 1. Hence all the cases except = 2 are excluded. When = 2, 
the solution of (|3.3p is -F(x) = ex" with a constant c > 0. This and (|3.2p determine M as 
M = A'^Q,. Then a obeys the restriction a < 2 as shown before the theorem. It is immediate to 
see that the isometry property actually holds if 9, a, F and M are as stated in the theorem. 

When aK'f' is a pull-back of K'l"^ as above, a geodesic shortest curve in (Fn,K'f') joining 
each A,B G P„ is uniquely determined as the image under A 1— > of that in {¥n,K'^'^) 

joining A°',B°'. Thanks to ()0.4p its explicit form is 

7(t) :=(^°#ti?°)i/°, 0<t<l. 



Furthermore, thanks to (10. Sp it is also immediate to get 

64A,B) = i5^J^-,i?-) = i||log(A--/2i3"^-"/2)||Hs 
a a 

= ||log(.4-"/25"^-a/2)l/a||^g^ 

as required. 



□ 
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It is desirable to prove the uniqueness of geodesic shortest curves for all metrics treated in 
Theorem 13.11 in the degree 2 case. 

We write tj^a for (j) arising in Theorem 13.31 i-e-, '4'a{x,y) := Na{x,y)'^ for < a < 2. It is 
worth noting that the geodesic shortest path and its distance in (P„, K^°') converge as a \ 
to those in (P„, K^^) where 4>l{x, y) ■= Mi^{x, y)^, the degree 2 power of the logarithmic mean. 
Namely, we have 

lim(^°#t5")i/" = exp((l -t)logyl + tlog5), < t < 1, 

lim ||log(^-"/25"^-"/2)V"||jjg = ||logA-logS||HS 

(see the 6 = 2 case of Theorem 12. ip . In fact, the latter follows from a version of the Lie- Trotter 
formula 

lim (^-"/25"^-"/2)i/" = exp(- log A + log B) 

a— ►0 

and the former is its modification (see [161 Lemma 3.3]). It is also worthwhile to note that 
II log(A~"/^S"^~"/^)^/"||HS is increasing in a > due to Araki's log-majorization [4J (see also 
p]). Hence 6^^{A,B) decreases to 6(i)^{A,B) as a \ while ipa{x,y) increases to (l)i,{x,y) 
as a \ 0. In fact, this kind of comparison property is true in general as we will see in the 
following sections. 

When Ai,...,Ak £ Pfi, since the arithmetic mean ^ Y2j=i -^i the unique minimizer of 

, it is immediate from Theorem 12.11 that a certain power mean 

Sj=i ) ^ " (understood as exp(| Sj=i ^^S ^j) if ^ = 2) is determined as a unique min- 
imizer of ^ G P„ Aj)- Let G{Ai, . . . , Afc) be the "geometric mean" introduced 
in [9l[8], i.e., the unique minimizer of ^ ^ Si=i '^m^ (^)^j)- It is also immediately seen from 
Theorem 13.31 that G(^°) • • • f^'kf'^'^ is a unique minimizer of ^4 i— > (5^^ (^, ^j), which is 
regarded A;-variable extension of (^"#5°) 

4 Comparison property 

The aim of this section is to compare the geodesic distances for different Riemannian metrics 
related to means in SOTq- A general result of this kind is the following: 

Theorem 4.1. Let M(i),M(2) g TIq, 01,02 G M and (f>^''\x , y) := M^^\x,yY*^, k = 1,2. Then 
the following conditions are equivalent: 

(i) (/)(^)(x,y) < (f>^^\x,y) for allx,y > 0; 

(ii) 6*1 = 6*2 = 0, or 01 = 02 > and M^'^\x, 1) < M^'^\x, 1) for all x > 0, or 0i = 62 < 
and M(i)(x, 1) > M^^^x, 1) for all x > 0; 

(iii) L^(i)(7) > L^(2){'y) for all curve in P„; 

(iv) (A, B) > 5^(2) {A, B) for all A,B e^n- 

The next lemma is useful to prove the theorem while it is meaningful by itself. 
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IHS- 



Lemma 4.2. Let M G TIq, 6* G M and ()){x,y) := M{x,yf. Then for every e P„ and 

£\0 £ 

Proof. We may assume that D = Diag(Ai, . . . , A„). Notice that 
so that 



H 



(L^,M^)-i > ( min A-^)I on (M„, (•, •)hs), 

yl<i<n J 



where I is the identity operator on M„. For each p > with p < min^ A- , since A €z Wn ^ 
(P(La,^a) is continuous, there exists an ri > such that if A G P„ and ||^ — -D||hs < then 



(L 



loo < P, 



where || • ||oo denotes the operator norm for operators on (M„, (•, •)hs)- Furthermore, since S^f, 
and II • IIhs define the same topology on P„ (see [181 Chapter IV, Proposition 3.5]), there exists 
an ro > such that if ^ G P„ and 5(j,{A, D) < rg then \\A — -D||hs < ^i- 

Now let H £ Mn and e > be sufficiently small so that 6^{D,D + eH) < rp and e||//||HS < 
ri. Let 7 : [0,1] — > P„ be any curve from D to D + eH such that Lfj,{'j) < rg. Since 
6(j,{'~f{t), D) < ro and so ||7(t) — D||hs < '"i for all < t < 1, we get 



4>{l)= C ^ (Y (i), , M^(t))-i7'(i)>HS dt 



> 



1-1 



plh'{t))ns dt 



> ||(0(Lz3,Mz?)-i-pI)i/2(eF)||Hs 
e 



HS 



[{Hx„x,r'-p)'%oH 

In the above, note that </>(L£), M^)^^ — pl> on the Hilbert space (M„, (•, •)hs) since 

p < minAj"^ = min0(Aj, A,)"^. 

Also, the second inequality above follows since \\{(f>{L,D,^D)"^ — pl)^^"^!' {t)\\us dt is the 
length in the Euclidean space {Mn, \\ ■ ||hs) and it is shortest if 7 is the segment between D and 
D + eH. Taking the infimum of L^{'y) gives 



6^{D,D + eH)>e [{cl){Xi, Xj)-' - p)'/^],- o H 



HS 



On the other hand, let 7o(t) := D + teH. Since ||7o(t) - -D||hs < f^ll-f^llns < ri for < t < 1, 
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we get 



6JD,D + eH) < LJjo) 



-1 



HS 



Since p is arbitrary, 

5^{D,D + eH) 
iim 

£\0 £ 



HS 



Lz5,LD)-l/2^ii'||HS 



□ 



Proof of Theorem \4-l\ First, (i) <^ (ii) is easy to check. To prove (i) ^ (iii), it suffices to show 
that (i) imphes that 



||<A«(Lz),Mfl)-i/2^||HS > \\cP^^HhD,RD)-'/^H\ 



HS 



(4.1) 



for all D £ ¥n and H £ ]HI„,. But this implication is immediately seen thanks to (jl.2p . (iii) 
=^ (iv) is obvious. Finally, assume (iv) and apply Lemma 14.21 to get ()4.ip for all D G P„ and 



if G EI„. When D :-- 



X 
y 



In-2 with x,y > and ii : = 



1 1 
1 1 



On- 



T\j means that 



-1 + 2(/>(i)(x,y)-i > a/x-i + 2(/)(2)(x,7/)-i + 



which gives (i) 



□ 



Remark 4.3. Let Vn := {D £ Fn : Tr D = 1}, a submanifold of P„. One can replace (P„,H„) 
by (I?„,E[„ Mi) and slightly modify the above proof to show that the above (i)-(iv) are also 
equivalent to the following conditions reduced on 2?„: 

(iii') L^(i)(7) > ^0(2) (7) for all curve in 

(iv') (5^(1) {A, B) > 5^(2) B) for all A,B £ where (5^(^, 5) denotes the geodesic distance 
in the Riemannian manifold {Vn^K'f'). 

By Theorems 12.11 and 14.11 we have: 

Corollary 4.4. Let M £ TIq, £ R and 4){x,y) := M{x,yY . If 4>{x,y) < (p0{x,y) for all 
x,y > (see Section 2 for cpg ), then for every A, B £ P„, 



2-0 2-0 



S4A,B)>S^^{A,B) = '>\^<>\ 

I ||log^ - logBllns if = 2. 



(4.2) 



If (^(xjy) > (l)g{x,y) for all x,y > 0, then the reversed inequality holds in (j4.2p . 
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The next theorem is a refinement of Corollary 14.41 with strict inequality under additional 
assumptions. 



Theorem 4.5. Let M , 6 and (p be as in Corollary \4-4[ Assume that A,B^ P„ are not 
commuting, i.e., AB / BA. If (p{x,y) < (pg{x,y) for all x,y > with x ^ y, then 6^{A, B) > 
{A, B). Similarly, 5^{A., B) < 6^^ {A, B) if (p{x, y) > (j^eix, y) for all x,y > with x ^ y. 

To prove the theorem, we need a simple lemma. 

Lemma 4.6. Let k = 1,2, be as in Theorem \4.1\ and assume that (j)^^\x,y) < (j)^'^\x,y) 
for all x,y > with x ^ y. If ^ : [0, 1] ^ P„ is a curve and 7(t)7'(i) ^ 7'(07(*) for some 
te [0,1], then L^w{-f) > L^(2)(7). 

Proof. It suffices to show that if D G P„ and H € ]HI„ are not commuting, then 

To prove this, we may assume that D = Diag(Ai, . . . , A„). Then DH ^ HD means that 

Hij 7^ for some {i,j) with Aj 7^ Xj, where H = [Hij]- Since (Ai,Aj) < (/>(2)(Ai,Aj) for such 
we obviously get 

as required. □ 

Proof of Theorem \4-5\ Assume that (p{x,y) < (f)g{x,y) for all x ^ y and on the contrary 
that 6(f,{A,B) = 6^g{A,B). Choose a sequence {7fc} of curves from A to B such that 
L(f){'yk) — *■ ^(i>ei^jB) as A; ^ cxD. The following proof is given in the case 9^2 but the case 
^ = 2 is similar with obvious modifications. Let ^fc(t) := 7fc(t)~2~ for < t < 1. Since Theorem 
14.11 gives 

6^^{A,B) < L^,(7fc) < L<^(7fc) 6^^{A,B) 

so that by Theorem 12.11 

|2 — 9\ 2-9 2-e 

-^r^L^eilk)= Wk{t)Usdt^\\A— - B—\\ns as /c ^ 00. 

^ JO 

By reparametrizing (,k{tys (hence 7fc(t)'s) one may assume that each has a constant speed, 
i.e., 

Set a := \\A~ — i?~||HS and Hq := a~^[B~ — A~), a unit vector in (E[„, (•, •)hs)- We 
notice 

/ |l-/ „.,yf] ,Ho\ ]dt = l-- -^-——(B^-A^,Ho)„^ 

Jo V Xll^fcWIlHs' Vhs/ |2-e|L^,(7fc)^ 

2a 

as k ^ 00. 



\2-e\L^,{^k 
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Hence, by taking a subsequence, one can assume that 



2 1 



HS 



Since ||eUi)llHS = 2-i|2-0|L^,(7fc 

\\C[(t)-{B— -A—)iius 
which imphes also that for every < t < 1 



iieu*)iiHs'"°/Hs 

a, this means that 

2-0 



a.e. t E [0,1]. 



Uk{t)- {{l-t)A— +tB 



2-e , 

2 



|HS 



< 



U'kis) - 



Now define Co(i) := (1 - t)A— + tB— and 7o(t) := io{t)^ 
can apply (12171) . (|331) and (l44]l to obtain 



a.e. te [0,1], 

- {B— - A—)) ds 

")I|hs ds — 
With Ge{x) 



(4.3) 



HS 



, 2-e , 2- 

- A— 



0. 



(4.4) 



one 



us 



7o{t)'^7o( 

0(LG,(5o(t)),MG,te,(t)))-i/2GW(L^.,(t),M5.,(i))eo(i)llHS 



lim 
lim 



HS 



HS 



for a.e. t G [0, 1]. Fatou's lemma gives 



^</.(7o) < liminf L0(7A;) = 6^g{A,B) = L^g{^o) 

fc— >oo 



(4.5) 



thanks to Theorem 12. li Here it is clear that (,o{t) and Co(0 commuting for any < t < 1. 

Hence 7o(i) and 7o(i) never commute for < t < 1. In fact, this is seen because Co(t) can be 
approximated by polynomials of 7o(i) and 7o(i) thanks to (j2.6p applied to ^o(^) = G'g ^(7o(t)) 
so that 7o(i)7o(i) = 7o(^)7o(i) implies Co(*)Co(0 = Co(OCo(i)- Hence ()4.5p contradicts the 
conclusion of Lemma 14.61 

The proof of the second assertion is easy. Assume that (/)(x, y) > (t)e{x^ y) for all x ^ y, and 
let 7o(i) be same as in the proof of the first assertion. Since 70 (i) and 7o(i) never commute for 
< t < 1 as mentioned above. Lemma 14.61 again implies that 



5^{A,B) < L<^(7o) < L<^,(7o) = S^oi^B), 



as required. 



□ 



The above proof of the first assertion is a bit involved. The proof would be much simpler if a 
geodesic shortest path joining A and B exists in (IP^, 7^!"*^), which is not known at the moment. 

Example 4.7. The following are examples of the inequality given in Corollary 14.41 in the cases 
of familiar means. In fact, these are immediate consequences of Corollary 14.41 and Lemma 
12.21 together with (I2.9p - (|2.12p and Lemma 12.51 Furthermore, Theorem 14.51 shows that all 
inequalities in the following become strict if A, B are not commuting and the respective closed 
range of 9 is replaced by the open range. 
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(1) For the O-power Mp^{x,y) = {^-j^Y of the arithmetic mean 



<6^^{A,B) ii9<-2,e>0, 
>6^g{A,B) ii-2<e<0. 



(2) For the 6'-power M^(x, y) = (^ ^^^ ) of the root 



29 

mean, 



\>St.(A.B) if0<9< 1. 



(3) For the 0-power Mf^{x,y) = (^ log^-fogy j of the logarithmic mean, 

(4) For the 0-power MQ(x,y) = (xy)^^'^ of the geometric mean, 



<6^g{A,B) iie<o,e>2, 

>5^^{A,B) if0<^<2. 



^ 1> 5^,(^,5) if < ^ < 4. 



(5) For the 0-power M^{x,y) = (|f^) of the harmonic 



V J 

SMe{A,B) 



mean, 



<64„iA,B) if^<0, 
>6^g{A,B) ifO<^<10. 

For any e M, Mi{{x, 1) < Mg{x, 1) holds for large x > since lima-^oo Mg{x, 1) = +oo 
while lima;^oo -^h(3;, 1) = 2. From this and Lemma 12.51 we observe that dj^^e {A, B) and 
6(pg {A, B) are not comparable when 6 > 10. 

In the case 9 = 2 the above example (4) with (jO.Sp says that 

\\log{A~^/^BA~^/^)\\us>\\logA-logB\\HS, A^GPn- 

This is the so-called exponential metric increasing (EMI) property in [7, 9j. On the other hand, 
for instance, (1) says that 



6,,2(A,B)<\\logA-logB\\us, A, Be 



which may be called the "exponential metric decreasing" property. In the case = 1 the above 
examples give 

5m^{A,B) > <5mg(A5) > hi^{A,B) > 2pi/2 -B1/2||hs > hiM^B), 

which may be called the "square metric increasing/decreasing" properties. 

In the particular case where (/>(x, y) = M{x, y) (of degree 6* = 1) is an operator monotone 
mean, i.e., M(x, 1) is a standard operator monotone function and moreover A^B are commut- 
ing, the next theorem gives the exact formula for 5m{A,B) independently of the choice of M. 
It seems that this independence of M is reflected by the uniqueness of a monotone Riemannian 
metric in the classical case (see |30j). 



21 



Theorem 4.8. Let M G 9Jto and assume that M{x, 1) is an operator monotone function. If 
A,B&¥n are commuting, then 

6M{A,B) = 2\\A^/^-B^/^\\ns, 
and a geodesic shortest curve from A to B is given by 

jA,B{t) ■■= ((1 - t)Ay^ + tB^'^)\ < t < 1, 

independently of the choice of M as above. Furthermore, this ja,b is a unique geodesic shortest 
curve from A to B whenever M ^ Mx ■ 

First we give a small lemma. 

Lemma 4.9. Assume that 7 : [0, 1] ^ P„ is a curve and 7(^)7' (i) = l' for all 
t £ [0,1]. Let ^(t) := 7(^)^/2, j^/^en Lm(7) = 2 W{t)\\Hsdt for all M as stated in Theorem 
\4-S\ (i.e., M € dJto with operator monotone M{x, 1)). 

Proof. Since M{x,x) = x for all x > 0, we note that \\M{L,d,^d)^^^'^H\\}is is independent of 
the choice of M whenever D G P„ and H € 1HI„ are commuting. This implies that LMij) is 
independent of M if 7 is as stated in the lemma. Hence the lemma follows by the 6 = 1 case 
of Theorem 12.11 □ 

Proof of Theorem \4-8\ Assume that AB = BA, and let ja.b be as given in the theorem. By 
Lemma 14.91 and Theorem 12. II we have 

Lm{7a,b) = Lm^{ia,b) = 2\\A^I^ - B^'^^is 

so that 5m{A,B) < 2||A^/^ — -B"'^/^||hs- To prove the converse, let $ denote the conditional 
expectation (with respect to Tr) of M„ onto the commutative subalgebra generated by A,B, 
and let 7 : [0, 1] P„ be an arbitrary curve from AtoB. Then $(7) is a curve in P„ 
from A io B. Since K^'^ is a monotone metric j30j (see also Introduction), we get 

K^\,^t)){Hiit))Mim < K!;l,^{iit),i{t)i < t < 1, 

so that L<^(<I>(7)) < L(f,{'y). Hence we may assume that 7(t)'s are in a commutative subalgebra. 
When ^(t) := 7(t)-^/^, we get by Lemma W9\ 

Lm{i) = 2 /' ||^'(t)||HS dt > 2pV2 _ B'/^us- 
Jo 

Hence 6(f){A,B) = 211^4^/^ — find 7a,_b is a common geodesic shortest curve from A to 

B for all metrics with operator monotone M. 

Next we show the last assertion on the uniqueness of a geodesic curve. To prove this, let 
71 : [0, 1] ^ P„ be a curve from AtoB such that Lm{ii) = 2\\A^/^ - B^/^Wus- Since Ma is 
the largest standard operator monotone function and M ^ Ma, note that M(x, 1) < Ma(x, 1) 
for all X > with x ^ 1. Since La/ (71) > Lm j^ili) by Theorem 14. 11 it follows from Lemma [4.61 
that 7i(t)7Ui) = 7i(i)7i(i) for all t £ [0, 1]. Lemma|lJ]in turn implies that /J \\C'i{t)\\Hsdt = 
pi/2_^i/2||^g^ where 6(i) := nit)^^"^- Therefore we get 6(0 = il-t)A^/'^+tB^/'^, 0<t<l, 
so that 7i = 7A,_B- n 
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When M = Ma and A, B are commuting, it is not known whether 5a,b is a unique geodesic 
shortest path joining A, B. To prove this, we probably need to examine the equahty case in the 
monotonicity of {H, H) under conditional expectation. Another problem for commuting 
A, B is whether 6a, b gives a geodesic shortest path for any metric with M € dJlo which is 
not necessarily operator monotone. 

We close the section with a remark on comparison of skew informations given in (j0.9p . Let 
/ and g be two standard operator monotone functions that are regular, i.e., f{0),g{0) > 0. 
It is immediate to see that l{j{K) > lfj{K) for all L» G P„ and K G EI„ if and only if 
f{0)/f{x) > g{0)/g{x) for all x > 0. For example, as for fp = fi-p, < p < 1/2, given in (|0.8p . 
fp{0)/fp{x) is increasing in p S (0,1/2] so that the Wigner-Yanase-Dyson skew information 
f^^^ {p, K) is increasing in p G (0, 1/2] for fixed D and K (see [5]). 



5 Unitarily invariant norms 

Let 1 1 1 - 1 1 1 be a unitarily invariant norm on matrices, that is, 1 1 1 - 1 1 1 is a norm on M„, n E N, such 
that = |||X||| for all X,U,V e with U,V unitaries. The Hilbert-Schmidt norm 

II • IIhs is a special example of such norms. When a kernel function (j) : (0, oo) x (0, oo) — > (0, oo) 
is given, replacing || • ||hs by ||| • ||| in (jl.3p we define the length 

^0,iiMii(7) ■■= C \mi^,it),^,it)r'N{t)\\\dt 



of a curve 7 : [0,1] P„. The distance 5^ \\y\\\[A,B) between yl, i? G P„ is the infimum 
of -^(/),|||-|||(7) over all curves 7 from A to B. The manifold P„ with the distance <5^,|||.||| is 
no longer a Riemannian manifold but a certain Finsler manifold. When 1 1 1 • 1 1 1 is the operator 
norm, such Finsler manifolds have been studied by several authors (see jll] for example). 

In this section we show that many results in the previous sections hold true even when the 
Hilbert-Schmidt norm || • ||hs is replaced by a general unitarily invariant norm ||| • |||. First, 
Theorem 12.11 can be extended as follows. We omit the proof that is essentially same as the 
second proof of Theorem 12. 1[ 

Proposition 5.1. Let ||| - ||| he any unitarily invariant norm. Let M, 6, (j) and F he as in 

Theorem \2.1[ Then the transformation D G P„ 1— > F{D) € 1HI„ is isometric from (P^, 5^ |||.|||) 
into (HI„, III • III) if and only if F is in the form ()2.ip and M = Mq (so (f) = (fie). Moreover, for 
every A,B e P„, 

\|||logA-logS||| ife = 2, 

and this distance is attained hy curve (j2.3p . 

The next comparison theorem is a partial extension of Theorem 14.11 An essential point of 
the proof is similar to that of [14^ Theorem 1.1]. 

Proposition 5.2. Let M^^\M^^'^ G Mq, 6 e R and (j)^''\x,y) := Mk{x,yY , k = 1,2. Then 
the following conditions are equivalent: 

(i) (M(^)(e*, 1)/M(2)(e*, 1))^/^ is a positive definite function on R; 

(ii) L^(i) |||.|||(7) ^ -^^0(2) Ill-Ill (7) /^'^ curve 7 in P„ and for any unitarily invariant norm 
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(iii) L^(i) ||.||^(7) ^ -^(/,(2) Ij-ljoo^'^) curve 7 in P„ and for the operator norm || • ||oo- 

Proof, (i) =^ (ii). It suffices to show that (i) implies that 

\U^^\i.d,^d)-^'^H\\\ > |||0(2)(LD,MD)~'/'i^||| (5.1) 



for all I? £ P„ and H G EI„. To do this, one may assume that D = Diag(Ai, . . . , A„). By (jl.2 
notice that 



</>(2)(L 



D, 



-1/2^ 



o(0«(LB,MB)-i/2i^) 



and 



U(2)(A,,A,)j Vm(2)(A,/A„1) 



61/2 



bmce (i) implies that [(</>«( Ai, A^OM^H^i, Aj)^/^]^. is a positive definite matrix with all diag- 
onal entries equal to 1, (|5.ip is obtained (see [Si 1.4.1] for example), 
(ii) =^ (iii) is trivial. 

(in) ^ (i). For k = 1,2, since continuous, it is obvious that 



lim '"""^ ^ = U^'^^hD, 

e\0 e 



-1/2^1 



for all D eFn and G H^, where [D, D+eH] denotes the straight segment D+teH, <t <1. 
Hence condition (iii) implies that 

||0(i)(Lb,Md)~^/^^||oo > 110(2) (Li5,Mz5)-l/2^||oo, DGFn, Hg Mn. 
When D = Diag(Ai, . . . , A„), this means that 



\H\\oo > 



V</.(2)(A„A,) 



H 



H G 



Now the proof of [H Theorem 1.1] shows that [((/)(^)(Ai, Aj)/(/)(2)(Aj, Aj))^/^] is positive 



□ 



semidefinite, which means (i) thanks to (15. 2p . 
Remark 5.3. The geodesic distance versions of the above (ii) and (iii) are 
(iv) (50(1) |||.|||(^, i?) > (5^(2) |||.|||(yl, i?) for all A, G P„ and for any unitarily invariant norm 



(v) 50(i)_||.|,^(^,i?) > 50(2),,|.||^(Ai?) for all A,BG P„. 



J0(2),]|.| 



Obviously, (ii) =^ (iv) and (iii) (v). It may be expected that (iv) and (v) are also equivalent 
to the conditions of Proposition 15.21 This would be proved as in the proof of (iii) =^ (i) if we 
have 



lim 

e\0 



Lb,Mb)-1/2//| 



for all G Pn, G IHI„ and for (p = with M G 9Jto- Although the above convergence for 
II • IIhs is Lemma 14.21 we do not know whether it is also true for || • ||oo. 
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For M^^\ M^"^^ G OKq consider the following conditions: 

(a) M(i)(x, 1) < M(2)(x, 1) for all x > 0; 

(b) M(i)(e*, 1)/M(2)(e*, 1) is positive definite on M (in this case we write M^^) ^ M^^)); 

(c) M(i)(e*, 1)/M(2)(e*, 1) is infinitely divisible in the sense that {MW[e\ 1)/M(2)(e*, l))'' is 
positive definite on M for any r > (in this case we write M^^^ <^ M^^-*). 

Obviously, (c) =^ (b) (a). Condition (a) appeared in Theorem 14.11 while (b) is in the 
case ^ = 2 of Proposition 15.21 We also note that (b) played an essential role in [ll],[15]. It was 
recently observed in [10^ [19] that the stronger condition (c) is even satisfied for many cases 
where M^^^M^^^ G SJIq satisfy (b). In fact, Kosaki [21] communicated to us that 

Mh < Mg < Ml < < Ma 

can be easily shown by applying [19, Corollary 3] and [10, Proposition 4]. Hence by Proposition 
[Q (also Remark [OD, if 6* > then 

'^M«,||M||(^>^) > "^AfSjIHIlC^'^) > '^M9,||H||(^'^) > '^Afy,|||.|||(^'^) > ^A^»,||M||(^'^)' 

and inequalities are reversed if 6* < 0. For {A''Q,}o<a<2 given in (j3.ip . ifO<a</5<2 then we 
have Nf3 < iVa by [IQ Theorem 2] since Ni3{e'^\l)/Na{e'^\l) = (/3/a)(sinh at/ sinh/3t). As 
for ipa = N^, similarly to Theorem 13.31 we have 

6^^„{A,B) = |||log(^-°/2i?"^-"/2)i/«|||, < a < 2, 

which decreases to (5^^2 |||.|||(74, i?) = ||| log A — logi?||| as a \ (this is also a consequence of 
Araki's log-majorization as mentioned at the end of Section 3). In particular, the inequality 

<5^2,|||.|||(Ai?) = |||log(^-i/2i?^-i/2)||| > |||logA-logi?||| 

is the generalized EMI in [7J. 
Finally, as for (pg we show: 

Proposition 5.4. Xet|||-||| be any unitarily invariant norm and A, B Then 6fj,g^^^^.^^^{A, B) 

(see Proposition \5.1\) is decreasing in 9 £ (— cxd,2] and increasing in 9 £ [2,oo). Furthermore, 



'^A'/s, Ill-Ill (^'^) 



<6^^„iA,B) if9<0,9>A, 
>5^,,||.|||(A5) if0<9<4. 



Proof. Assume that 9' < 9 < 2 or 2 < 9 < 9' , and define a kernel function k : (0, cxd) x (0, cxd) 
(0,oo) by 

2-8 2-9 

2 — 9 X 2 — y 2 



k{x,y) :-- 



2-e' 2-6' 

X 2 — y 2 



The kernel fc(x, y) is positive definite (even infinitely divisible) by [10^ Theorem 2] and fc(x, x) = 
1 for all X > 0. With the diagonahzations A = C/Diag(Ai, . . . , \n)U* and B = yDiag(/Ui, . . . , //n)^* 
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we write 



2 / 2-0 2-0 \ I 1 

A— -B—\ = U 



2-9 



2-0 2-6 
X 2 _ 2 



o{U*V)\V* 

ij / 

2-0' 2-0' " 

\ 2 ,,2 



o {U*V) ]v* 



Hence [3 1.4.1] can be applied to obtain |||.|||(74, < , |||.|||(A, thanks to Proposition 

EH 

The second assertion (extending (4) of Example 14. 7p follows since Alg <^ AIq for 6 > 4 and 
Mg « Me for 6* < 4 (see pO, §2.6]). □ 
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